AITopics | efficient design

Fast Vision Transformers with HiLo Attention

Neural Information Processing SystemsDec-24-2025, 07:10:26 GMT

Vision Transformers (ViTs) have triggered the most recent and significant breakthroughs in computer vision. Their efficient designs are mostly guided by the indirect metric of computational complexity, i.e., FLOPs, which however has a clear gap with the direct metric such as throughput. Thus, we propose to use the direct speed evaluation on the target platform as the design principle for efficient ViTs. Particularly, we introduce LITv2, a simple and effective ViT which performs favourably against the existing state-of-the-art methods across a spectrum of different model sizes with faster speed. At the core of LITv2 is a novel self-attention mechanism, which we dub HiLo. HiLo is inspired by the insight that high frequencies in an image capture local fine details and low frequencies focus on global structures, whereas a multi-head self-attention layer neglects the characteristic of different frequencies.

fast vision transformer, name change, vision transformer, (11 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Vision (0.98)

Add feedback

Fast Vision Transformers with HiLo Attention

Neural Information Processing SystemsOct-11-2024, 06:20:41 GMT

Vision Transformers (ViTs) have triggered the most recent and significant breakthroughs in computer vision. Their efficient designs are mostly guided by the indirect metric of computational complexity, i.e., FLOPs, which however has a clear gap with the direct metric such as throughput. Thus, we propose to use the direct speed evaluation on the target platform as the design principle for efficient ViTs. Particularly, we introduce LITv2, a simple and effective ViT which performs favourably against the existing state-of-the-art methods across a spectrum of different model sizes with faster speed. At the core of LITv2 is a novel self-attention mechanism, which we dub HiLo. HiLo is inspired by the insight that high frequencies in an image capture local fine details and low frequencies focus on global structures, whereas a multi-head self-attention layer neglects the characteristic of different frequencies.

fast vision transformer, frequency, vision transformer, (8 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Vision (1.00)

Add feedback

RTFormer: Efficient Design for Real-Time Semantic Segmentation with Transformer

Neural Information Processing SystemsOct-10-2024, 13:20:49 GMT

Recently, transformer-based networks have shown impressive results in semantic segmentation. Yet for real-time semantic segmentation, pure CNN-based approaches still dominate in this field, due to the time-consuming computation mechanism of transformer. We propose RTFormer, an efficient dual-resolution transformer for real-time semantic segmenation, which achieves better trade-off between performance and efficiency than CNN-based models. To achieve high inference efficiency on GPU-like devices, our RTFormer leverages GPU-Friendly Attention with linear complexity and discards the multi-head mechanism. Besides, we find that cross-resolution attention is more efficient to gather global context information for high-resolution branch by spreading the high level knowledge learned from low-resolution branch. Extensive experiments on mainstream benchmarks demonstrate the effectiveness of our proposed RTFormer, it achieves state-of-the-art on Cityscapes, CamVid and COCOStuff, and shows promising results on ADE20K.

efficient design, real-time semantic segmentation, rtformer, (3 more...)

Neural Information Processing Systems

Technology:

Information Technology > Architecture > Real Time Systems (0.92)
Information Technology > Artificial Intelligence (0.71)

Add feedback

A Method for the Efficient Design of Boltzmann Machines for Classiffication Problems

Neural Information Processing SystemsApr-6-2023, 19:42:14 GMT

We introduce a method for the efficient design of a Boltzmann machine (or a Hopfield net) that computes an arbitrary given Boolean function f . This method is based on an efficient simulation of acyclic circuits with threshold gates by Boltzmann machines. As a consequence we can show that various concrete Boolean functions f that are relevant for classification problems can be computed by scalable Boltzmann machines that are guaranteed to converge to their global maximum configuration with high probability after constantly many steps.

boltzmann machine, classiffication problem, efficient design

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Add feedback

Best Research Papers on Vision Transformers

#artificialintelligenceJun-1-2022, 02:29:33 GMT

Abstract: Vision Transformers (ViTs) have triggered the most recent and significant breakthroughs in computer vision. Their efficient designs are mostly guided by the indirect metric of computational complexity, i.e., FLOPs, which however has a clear gap with the direct metric such as throughput. Thus, we propose to use the direct speed evaluation on the target platform as the design principle for efficient ViTs. Particularly, we introduce LITv2, a simple and effective ViT which performs favourably against the existing state-of-the-art methods across a spectrum of different model sizes with faster speed. At the core of LITv2 is a novel self-attention mechanism, which we dub HiLo.

best research paper, transformer, vision transformer, (12 more...)

#artificialintelligence

Industry: Information Technology > Security & Privacy (0.31)

Technology: Information Technology > Artificial Intelligence > Vision (1.00)

Add feedback

Adaptive Task Planning for Large-Scale Robotized Warehouses

Shi, Dingyuan, Tong, Yongxin, Zhou, Zimu, Xu, Ke, Tan, Wenzhe, Li, Hongbo

arXiv.org Artificial IntelligenceApr-24-2022

Robotized warehouses are deployed to automatically distribute millions of items brought by the massive logistic orders from e-commerce. A key to automated item distribution is to plan paths for robots, also known as task planning, where each task is to deliver racks with items to pickers for processing and then return the rack back. Prior solutions are unfit for large-scale robotized warehouses due to the inflexibility to time-varying item arrivals and the low efficiency for high throughput. In this paper, we propose a new task planning problem called TPRW, which aims to minimize the end-to-end makespan that incorporates the entire item distribution pipeline, known as a fulfilment cycle. Direct extensions from state-of-the-art path finding methods are ineffective to solve the TPRW problem because they fail to adapt to the bottleneck variations of fulfillment cycles. In response, we propose Efficient Adaptive Task Planning, a framework for large-scale robotized warehouses with time-varying item arrivals. It adaptively selects racks to fulfill at each timestamp via reinforcement learning, accounting for the time-varying bottleneck of the fulfillment cycles. Then it finds paths for robots to transport the selected racks. The framework adopts a series of efficient optimizations on both time and memory to handle large-scale item throughput. Evaluations on both synthesized and real data show an improvement of $37.1\%$ in effectiveness and $75.5\%$ in efficiency over the state-of-the-arts.

algorithm, artificial intelligence, robot, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/ICDE53745.2022.00314

2205.00831

Country: Asia (0.28)

Genre: Research Report > New Finding (0.46)

Industry:

Transportation > Freight & Logistics Services (0.68)
Energy > Oil & Gas (0.54)
Transportation > Ground > Road (0.46)
Information Technology > Services > e-Commerce Services (0.34)

Technology:

Information Technology > Artificial Intelligence > Robots > Robot Planning & Action (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.95)

Add feedback

Efficient Designs of SLOPE Penalty Sequences in Finite Dimension

Zhang, Yiliang, Bu, Zhiqi

arXiv.org Machine LearningFeb-16-2021

In linear regression, SLOPE is a new convex analysis method that generalizes the Lasso via the sorted L1 penalty: larger fitted coefficients are penalized more heavily. This magnitude-dependent regularization requires an input of penalty sequence $\lambda$, instead of a scalar penalty as in the Lasso case, thus making the design extremely expensive in computation. In this paper, we propose two efficient algorithms to design the possibly high-dimensional SLOPE penalty, in order to minimize the mean squared error. For Gaussian data matrices, we propose a first order Projected Gradient Descent (PGD) under the Approximate Message Passing regime. For general data matrices, we present a zero-th order Coordinate Descent (CD) to design a sub-class of SLOPE, referred to as the k-level SLOPE. Our CD allows a useful trade-off between the accuracy and the computation speed. We demonstrate the performance of SLOPE with our designs via extensive experiments on synthetic data and real-world datasets.

efficient design, projection, slope penalty sequence, (10 more...)

arXiv.org Machine Learning

2102.07211

Country:

Europe > Russia (0.04)
Asia > Russia (0.04)
North America > United States > Pennsylvania (0.04)
(2 more...)

Genre: Research Report (0.82)

Industry: Health & Medicine > Therapeutic Area (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.35)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.34)

Add feedback

A Method for the Efficient Design of Boltzmann Machines for Classiffication Problems

Gupta, Ajay, Maass, Wolfgang

Neural Information Processing SystemsDec-31-1991

A Boltzmann machine ([AHS], [HS], [AK]) is a neural network model in which the units update their states according to a stochastic decision rule. It consists of a set U of units, a set C of unordered pairs of elements of U, and an assignment of connection strengths S: C -- R. A configuration of a Boltzmann machine is a map k: U -- {O, I}.

boltzmann machine, node, state change, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Illinois > Cook County > Chicago (0.05)
North America > United States > New York (0.05)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.93)

Add feedback

A Method for the Efficient Design of Boltzmann Machines for Classiffication Problems

Gupta, Ajay, Maass, Wolfgang

Neural Information Processing SystemsDec-31-1991

A Boltzmann machine ([AHS], [HS], [AK]) is a neural network model in which the units update their states according to a stochastic decision rule. It consists of a set U of units, a set C of unordered pairs of elements of U, and an assignment of connection strengths S: C -- R. A configuration of a Boltzmann machine is a map k: U -- {O, I}.

boltzmann machine, node, state change, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > Illinois > Cook County > Chicago (0.05)
North America > United States > New York (0.05)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.93)

Add feedback

A Method for the Efficient Design of Boltzmann Machines for Classiffication Problems

Gupta, Ajay, Maass, Wolfgang

Neural Information Processing SystemsDec-31-1991

A Boltzmann machine ([AHS], [HS], [AK]) is a neural network model in which the units update their states according to a stochastic decision rule. It consists of a set U of units, a set C of unordered pairs of elements of U, and an assignment of connection strengths S: C -- R. A configuration of a Boltzmann machine is a map k: U -- {O, I}.

artificial intelligence, boltzmann machine, machine learning, (15 more...)

Neural Information Processing Systems

Country: North America > United States > Illinois (0.15)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.93)

Add feedback

Filters

Collaborating Authors

efficient design

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Fast Vision Transformers with HiLo Attention

Fast Vision Transformers with HiLo Attention

RTFormer: Efficient Design for Real-Time Semantic Segmentation with Transformer

A Method for the Efficient Design of Boltzmann Machines for Classiffication Problems

Best Research Papers on Vision Transformers

Adaptive Task Planning for Large-Scale Robotized Warehouses

Efficient Designs of SLOPE Penalty Sequences in Finite Dimension

A Method for the Efficient Design of Boltzmann Machines for Classiffication Problems

A Method for the Efficient Design of Boltzmann Machines for Classiffication Problems

A Method for the Efficient Design of Boltzmann Machines for Classiffication Problems